PSNet: Parallel Symmetric Network for Video Salient Object Detection
نویسندگان
چکیده
For the video salient object detection (VSOD) task, how to excavate information from appearance modality and motion has always been a topic of great concern. The two-stream structure, including an RGB stream optical flow stream, widely used as typical pipeline for VSOD tasks, but existing methods usually only use features unidirectionally guide or adaptively blindly fuse two features. However, these underperform in diverse scenarios due uncomprehensive unspecific learning schemes. In this paper, following more secure modeling philosophy, we deeply investigate importance comprehensive way propose network with up down parallel symmetry, named PSNet. Two branches different dominant modalities are set achieve complete saliency decoding cooperation Gather Diffusion Reinforcement (GDR) module Cross-modality Refinement Complement (CRC) module. Finally, Importance Perception Fusion (IPF) according their scenarios. Experiments on four dataset benchmarks demonstrate that our method achieves desirable competitive performance.
منابع مشابه
MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection
Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...
متن کاملEfficient Co-Salient Video Object Detection Based on Preattentive Processing
Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...
متن کاملVideo Salient Object Detection Using Spatiotemporal Deep Features
This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional handcrafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal C...
متن کاملImpression Network for Video Object Detection
Video object detection is more challenging compared to image object detection. Previous works proved that applying object detector frame by frame is not only slow but also inaccurate. Visual clues get weakened by defocus and motion blur, causing failure on corresponding frames. Multiframe feature fusion methods proved effective in improving the accuracy, but they dramatically sacrifice the spee...
متن کاملSalient Object Detection using a Context-Aware Refinement Network
Recently there has been remarkable success in pushing the state of the art in salient object detection. Most of the improvements are driven by employing end-to-end deeper feed-forward networks. However, in many cases precisely detecting salient regions requires representation of fine details. Combining high-level and low-level features using skip connections is a strategy that has been proposed...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE transactions on emerging topics in computational intelligence
سال: 2023
ISSN: ['2471-285X']
DOI: https://doi.org/10.1109/tetci.2022.3220250